Steps towards nonlinear cluster inversion through 
gravitational distortions: 

III. Including a redshift distribution of the sources 

Carolin Seitz & Peter Schneider 

Max-Planck-Institut fur Astrophysik, Postfach 1523, D-85740 Garching, Germany. 
Abstract 

In a series of previous papers we have considered the reconstruction of the surface mass 
density of a cluster of galaxies from images of lensed faint background galaxies. We 
showed that the reconstructed surface mass density is not uniquely determined, but that 
there exists a global invariance transformation that leaves the shape of the images of 
the lensed galaxies unchanged. Because of this, only lower limits on the total mass of 
a cluster can be derived if no further informations besides image ellipticities are used. 
Throughout these papers we used the simplifying assumption that all sources are at the 
same redshift. 

In this paper we account for a redshift distribution of the faint galaxies, and in par- 
ticular, some of these galaxies can lie in front of the cluster or can be cluster members. 
We show how the mass distribution of a cluster of galaxies can be obtained from images 
of these faint galaxies, if the redshift distribution of these galaxies is known. We demon- 
strate that for the reconstruction of non-critical clusters we need less information on the 
redshift distribution of the galaxies, i.e., we only need to know two or three moments of 
the distribution. 

We show that the mean mass density across the data field is still a free variable, 
i.e., there remains a global invariance transformation of the resulting mass density field. 
For non-critical clusters we can derive the transformation explicitly and it is similar to 
that derived previously for the case of a single redshift of the sources. We discuss several 
theoretical ideas to break the mass degeneracy; of those considered, we find that only 
the magnification effect on the number density of galaxy images can be used successfully 
in practice. 
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1 Introduction 



This is the third and final paper in a series in which we have considered the reconstruction 
of the surface mass density of galaxy clusters from the (weak) image distortion it imposes 
onto faint background galaxies. Inspired by the pioneering work of Kaiser & Squires 
(1993, hereafter KS), who derived a parameter-free inversion equation for the surface 
mass density of the cluster in terms of the tidal deflection field [which has been discovered 
earlier by Tyson, Valdes & Wenk (1990); earlier work on cluster mass determination by 
weak lensing effects include Kochanek (1990) and Miralda-Escude (1991)] we have started 
to generalize the KS inversion method to include also strong clusters, i.e., cluster which 
are - at least nearly - capable to produce giant arcs. In Paper I (Schneider & Seitz 
1995) we have analyzed the basic observable from image distortions and pointed out a 
global invariance transformation of the surface mass density which leaves the observable 
distortion invariant. In Paper II (Seitz & Schneider 1995) we have then developed an 
iterative procedure to reconstruct the density field, which was then applied to synthetic 
data and shown to work well. 

In the present paper, we want to generalize the treatment of Paper II in two different 
ways. First, the inversion procedure constructed in Paper II is not unbiased and contains 
boundary artefacts in the same way as the original KS method, due to the fact that the 
observations are always limited to a finite data field. There are now several different 
inversion methods which are unbiased (Schneider 1995; Kaiser et al. 1995; Bartelmann 
1995, Seitz & Schneider 1996) - all of them are based on a relation found by Kaiser (1995). 
As was demonstrated in Seitz & Schneider (1996), these finite-field methods work well 
and can be applied efficiently. Second, in the earlier papers cited above it was assumed 
that all sources have the same effective redshift; by that we mean that all sources have 
about the same ratio D(zd, z)/D(z) of the angular diameter distance as measured from 
the lens and from the observer. This assumption is fairly well justified if the cluster 
redshift is relatively small, z^ 0.2, say. For weak cluster lenses, this assumption can 
be easily dropped since then the (linear) reconstruction proceeds equivalently to the case 
that all sources are at about the 'mean redshift' of the population (this will be made 
more precise in Sect. 4.3 below). However, if the cluster is not assumed to be weak, the 
consideration of the source redshift distribution becomes more difficult. We shall discuss 
this problem in some detail below. 

The rest of the paper is organized as follows: in Sect. 2 we present basic equations 
and our notation. In Sect. 3 we derive the dependence of the local observables on the 
local values of the surface mass density and the shear. In Sect. 4, we describe the non- 
linear reconstruction method; after briefly reviewing the unbiased finite-field inversion 
method developed by Seitz & Schneider (1996) for the case of a single source redshift, 
we generalize this technique to the case of a redshift distribution. As in the case of a 
single source redshift, we also have a global invariance transformation, which, however, 
cannot be written in closed form in general. However, restricting the consideration to 
moderately strong clusters, we construct the invariance transformation explicitly. As we 
show in Sect. 5 and in Appendix 3, the presence of a redshift distribution of the sources 
provides several methods to break the invariance transformation. 

However, from simulations we found that among the methods considered, only that is 
successful in practice which makes use of the magnification effect on the number density of 
galaxy images (see also Broadhurst, Taylor & Peacock 1995). In contrast to Broadhurst, 
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Taylor & Peacock (1995) and Broadhurst (1995) we do not use the local magnification 
to derive the local mass density, but we use the magnification averaged over the field to 
constrain the mean mass density in the field (in order to combine information from the 
shear with local magnification information, a maximum likelihood approach seems to be 
the best strategy - see Bartelmann et al. 1996). Essential for the success of this method 
is that the source counts deviate sufficiently strongly from the n(S) oc S~ 2 behaviour, 
which seems to be the case for galaxies with red colour. Furthermore, the method 
depends strongly on the assumption that the faint background galaxies are distributed 
rather smoothly, i.e., that no strong correlations in their angular position is present, as 
seems to be justified by recent investigations (e.g., Infante & Pritchet 1995). 

We apply our methods to synthetic data in Sect. 6 to demonstrate its feasibility. The 
application to an HST exposure of the cluster C10939+4713 will be published elsewhere 
(Seitz et al. 1995c). We discuss our results in Sect. 7. 



2 Basic equations 



2.1 Redshift dependence of the lens equation 

The distortion of images of background galaxies depends on the dimensionless surface 
mass density of the lens, which is the physical surface mass density S(0) divided by 
the critical surface mass density S C rit- If we now consider the background sources to be 
distributed in redshift, then the critical surface mass density depends on the redshift z 
of the source: 

{oo for z< Zd, 
for (2.1) 
47rGD(z d )D(z,z d ) Z> ^ 

Here D(z d ) and D(z) are the angular diameter-distances from the observer to the lens at 
redshift z d and to the source at redshift z, and D(z d , z) is the angular diameter-distance 
from the lens to the source. Defining 

w ( z - z ) — nm z^oo ^crit(z) _ ^critpp ^ 2) 

^crit 

(z) >;«.,.•„ ( ■- ) 

we obtain for the dimensionless surface mass density k(0, z) at angular position 6 for a 
source at redshift z 

= = y^y^s " K{6) w ^ z ^ ■ (2 - 3) 

The function w(z; z d ) relates the 'lensing strength' for a source with redshift z to that of 
a hypothetical source at 'infinite redshift', and its form depends on the geometry of the 
universe. For an Einstein-de Sitter universe we have 

{0 for z< Zd 
v/TT^-v/TTid" c ( 2 - 4 ) 
. for z > Zd ■ 
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In particular, for sources with redshift smaller than that of the lens, the 'lensing strength' 
vanishes. For the rest of this paper, we consider a single cluster lens at redshift z^, and 
drop the second argument of w, i.e., w(z;z^) = w(z). 

Since the shear 7 is related linearly to the surface mass density, its dependence on 
source redshift is the same as for k, 

7 (0,z) = 7i(0,z) + i 72 (0,z) = - / d 2 e'v(e-e')K(d',z) = w(z) 1 (e) , (2.5) 

where the complex kernel is X>(x) = (x\ — x\ + c X\x\x<i)j |x| 4 . Because of the relations 
(2.3) & (2.5) it is sufficient to consider the dimensionless surface mass density and shear 
with respect to one particular source redshift, which is at infinity for our choice. 
The lens equation for a source with redshift z is 

/3 = 0-W>(M , (2.6) 

where ift(0,z) = w(z)ip(0) is the deflection potential at for a source at redshift z. 
Hence, the linearized lens equation at which describes the image distortion of small 
sources is given by 

d P T „ j„\ ( ^12 



where X is the two-dimensional unit matrix and ipij = d9 .g e . ■ The magnification /u(0) of 
an image with position and a source redshift z then becomes 

M<M := rrr ) ln = ~ ■ (2-8) 



|det A(0,z 



[1-w(z)k(0)] 2 -w 2 (z)\ 1 (6) 



The cluster is non-critical for sources at redshift z if det A(0, z) > everywhere; it is 
non-critical for all source redshifts if [1 - k(0)} 2 - | 7 (0)| 2 > for all 0. 



3 Local observables and their dependence on local lens 
parameters k and 7 

Throughout this section we assume that these lensing parameters can be considered as 
constant over a small solid angle around 0, and we will therefore suppress the argument 
in all equations. 

In Papers I & II we described the shape of an image by the complex number 

Q11 - Q22 + 2iQi 2 
X = n n , (3-1) 

where Qij are the components of the tensor of second moments of the surface brightness 
of the image. The same relation was used to define the source ellipticity x s in terms of 
the second brightness moments Q s of the source. An image with elliptical isophotes has 
|x| = (1 — r 2 )/(l + r 2 ), if r G [0, 1] is the axis ratio of the ellipse. Here, we define 

V (3.2) 



i + \/i-M 2 
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as the ellipticity parameter; for an image with elliptical isophotes, |e| = (1 — r)/(l + r), 
and the phase of e is the same as that of x- Defining 

g(z) = = , (3.3) 

1 — k(z) 1 — W{Z)K 

the transformation between source ellipticity Xs and image ellipticity x f° r a source with 
redshift z has been shown in Paper I to be 

Y= xs-2^) + oj 2 (^)x: , 34o) 

l + b(z)| 2 -27Je[(/(z)x:] ' 

where an asterisk denotes complex conjugation. From this transformation we find with 
(3.2) the transformation between the ellipticity parameter e of an image and that of the 
corresponding source e s : 

^s-g(z) . . 

for |^(z)| < 1 , 



e s -f *{z) 

The condition |<7(z)| < 1 (|<7(z)| > 1) is equivalent to the condition det A(z) > 
(det A(z) < 0). 

Now, let Pe s (y) y dy dip be the probability that the source ellipticity e s = y exp(2i</?) 
is within y dy dip around e s . Then, for fixed redshift z, the expectation value of the n-th 
moment e n is given through 

r i /-27T ( [~g(z)] n for \g(z)\ < 1 , 

^(^I^pmI = {(_zL)" for | 9W1>1 . 

The remarkable fact that the expectation values (e n ) £s do not depend on the source 
ellipticity distribution, whereas (x) x does (Papers I & II), is the reason for choosing e as 
the ellipticity parameter in this paper. We derive (3.5) in the Appendix 1. 

If the galaxies are distributed in redshift according to the probability density p z (z) , 
the expectation values of the moments e n become: 



POO 

<O es ,^ / dz Pz (z) (e"> £ » 
Jo 

= [ dz p z (z)[-g(z)] n + I dzp z (z) (-^-) 

JdetA(z)>0 JdetA(z)<0 \9 \ z ) J 



-W \ I 1 \ / , . s / 1 — KW 



= 7 n / dwp w (w)[- + — / dwp w (w) 



jtlcv — ^ V "1 / V * / / i uj \ — / i 

detA(w)>0 \l — KWy \7 J ,/detA(™)<0 V ~ w 

= 7 n X n ( K , 7 ) + ^Y n ( K , 7) = 7 n ^X n («, 7 ) + ^p^n(«, 7) 

(3.6) 

In the third line we used the transformation of the source redshift distribution into their 
^-distribution, which consists of a delta 'function' at w = for galaxies with z < za and 
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is given by p w (w) dw = p z (z) dz for galaxies with z > Zd- For a single redshift zq of the 
sources, the expectation values reduce to (e n ) £s z = (e n ) £s {zq) given in (3.5). 

Generally, the boundaries of the integrals in (3.6), and therefore X n and Y n , depend 
on k and 7, 



{I 



min(l,l/[«+| 7 |]) r l \ f _ w N n 



X„(«,7)= / +/ dtyp w (ty) 



<(!.« — ItI) 



1 . . 

/"max(l, K -| 7 |) / 1 - \ 

>n(«,7) = / dtwp„,(iw) 

^min(l,l/[«+| 7 |l) V ~ w J 



n 



1 — KW 

(3.7) 



One sees that for some values of the parameters k and 7 there exist two intervals of w 
for which det A(w) > 0, seperated by that interval for which det A(w) < 0. 

In the case of weak lensing we obtain from (3.6) that (e n ) £ z ~ (— l) n (w n ) 7 n , and 
the shear is - modulo (w n ) - an observable. 

An estimate of these expectation values for the moments of the image ellipticities 
can be obtained by considering (locally) an ensemble of sources with ellipticities and 
defining the means 

where u k is an appropriately chosen weight factor (see Paper II or Seitz & Schneider 
1996). These mean values e n are statistically distributed around the expectation values 
(e n L ,i and we use them as an estimate for (e n ) c „. 

To summarize, the mean image ellipticity and their higher moments (e n ) e do no 
longer directly provide us with a local estimate for 7/(1 — k) or its inverse as in the case 
where all sources are at the same redshift, which was assumed in previous papers. In fact, 
the dependence of (e) on the local lens parameters k and 7 can be quite complicated, 
depending on the redshift distribution of the sources and on the local lens parameters 
itself. 

To obtain information on the local lens parameters k, 7 from e ~ (e) z we have to 
know the redshift distribution p z (z) of the galaxy population which enters Eq. (3.6). We 
point out that the average in (3.6) extends over all galaxies, i.e., also over those situated 
in front of the lens, which, however, do not provide any information on the local lens 
parameters, since (e n ) £s (z) = for z < Zd, but contribute to the noise which is inherent 
in the estimate of the local lens parameters derived from e. In the rest of this paper we 
assume that this redshift distribution of galaxies is known or at least can be estimated, 
e.g., by the lensing effect itself (see Bartelmann & Narayan 1995, Kneib et al. 1995). 

For illustration we assume that the redshift distribution is given by a function of the 

form 

p * iz) = r(3/p)z$ «p (~ (*/*>)') » ( 3 - 9 ) 

taken from Brainerd et al. (1995), with moments (z) = 3zo for (3 = 1, (z) = 1.5 zq for 
(3 = 1.5 and (z) = T(4/3) z for (3 = 3. 

In Fig. 1 we show this distribution p(z) for (3 = 1 = (z) and the corresponding 
distribution p w (w) for a lens with redshift Zd = 0.4. 
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Fig. 1. The lower panel (solid line) shows the redshift distribution given in Eq. (3.9) for (z) = 3zo = 1 
and (3 = 1. The dashed line shows the function w(z) defined in Eq. (2.4) for a cluster redshift of = 0.4. 
The upper panel shows the corresponding distribution p w (w). Note that p w (w) has a delta-'function' 
peak at w = 0, with an amplitude given by the probability that a source has a redshift smaller than that 
of the lens 



4 The reconstruction of the cluster mass distribution 



4.1 Inversion relations for a single source redshift 

The reconstruction of the surface mass density of a cluster from lensed images of back- 
ground sources has been described in several papers (see introduction) for sources at 
the same redshift z. Let n(0) be the dimensionless surface mass density of the cluster, 
scaled by the appropriate critical surface mass density, and let 'y(O) be the corresponding 
complex shear. Both quantities are given as second partial derivatives of the deflection 
potential, and it was shown by Kaiser (1995) that the following relation between third 
partial derivatives of the deflection potential, or first partial derivatives of k and 7, is 
valid: 

Vn(e) = -(^ + ^A^V(6) (4.1) 

\72,i - 71,2 / 

Hence, the surface mass density can be obtained in terms of the shear by integrating 
this first-order differential equation. However, the shear is not an observable in general 
(Kochanek 1990; Paper I), but the quantity g(0) = 7(0)/[l-«(0)], if we confine our con- 

siderations to non-critical clusters. (For critical clusters only 2g/ (1 + \g\ ) is observable.) 
Inserting 7 = g (1 — k) into (4.1) yields (Kaiser 1995) 
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VK(0) = - \—J l + gi / 2 )( 9l ' 1+9 *A= u{0 ) , (4.2) 

l-9i-9 2 \ 92 I ~ 9i J \92,i ~ 9i,2 J 

where 

K(6) :=ln[l-«(0)] . (4.3) 

Hence, it is possible to derive the gradient of the quantity if in terms of the observable 
quantity g. Obviously, the integration of (4.2) allows an arbitrary integration constant, 
i.e., K(6) can only be determined up to an additive constant. Several methods exist to 
perform an integration of (4.2); see Schneider (1995), Kaiser et al. (1995), Bartelmann 
(1995), and Seitz & Schneider (1996). All of them would be equivalent if u(0) were a 
gradient vector field. However, since u is determined observationally, it is not exact and 
thus in general not rotation-free. The integration derived in Seitz & Schneider (1996), 

K{6) = [ dV H(0';0) -u(0') +K , (4.4) 
Ju 

was explicitly constructed to account for this 'rotational noise component' and has been 
demonstrated in Seitz & Schneider (1996) to work better than the other proposed meth- 
ods. Here, H(0'; 0) is a vector field explicitly constructed in Seitz & Schneider (1996), 
and K is the average of K(0) over the data field U, i.e., the region where image elliptici- 
ties have been measured. Of course, K is an undetermined constant, so that the surface 
mass density k(0) is determined up to the transformation 

k(0) -> (1 - A) + Ak(0) . (4.5) 



4.2 General inversion method 

In the general case of a redshift distribution of the sources, we again make use of (4.1). 
The formal integration of (4.1) proceeds in the same way as that of (4.2), i.e., 

K (0) = [ d 2 6' H(0';0) -U(0') + R , (4.6) 
Ju 

where R is the average of k over the data field U. The vector field U(0) is defined in 
(4.1) and given by first partial derivatives of the shear. The shear in turn is related to 
the mean image ellipticity via (3.6), 

(6) w = 7^i(«,7) + ^i(«,7)j«e . (4.7) 

Note that Eq. (4.7) is a local relation, valid at every point 0. The complicated dependence 
of (e) e z on n and 7 suggests an iterative approach for the solution of the inversion 

problem: let e be an 'measured' estimate for (e) z and let 7(^(0) and «( n )(0) be an 
estimate for the shear field and the surface mass density. From that, an updated estimate 
for the shear field can be obtained, using (4.7): 

7 (»+i)(0) = e 



x 1 (^)(0), 7 ^(0)) + yi( ^ )(g) ' 7( : )(0)) 



(4.8) 
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Then, by differentiation, the vector field U (n+1 - ) (0) can be calculated from (4.1), by using 
the shear field 7 (n+1 )(0). And finally, an updated estimate for the surface mass density 
field is obtained from (4.6), 

K («+i)(0) = / dVH(0';0) -U (n+1) (0') + « • (4-9) 



This iteration process is started by chosing k^°\6) = 0, <y(°\0) = 0. It is clear that the 
integration constant R is still a free variable, i.e., with the method described here there 
remains a global invariance transformation of the resulting surface mass density field; 
in contrast to the case considered in Sect. 4.1, this transformation cannot be explicitly 
determined, due to the highly nonlinear relations occurring here. For critical clusters we 
need at most 10 steps to achieve a convergence of the iteration algorithm. For less massive 
clusters about 5 iteration steps are sufficient. We find that the iteration algorithm is more 
stable and converges faster than in the case of a single source redshift (see Paper II), 
mainly because there are no well-defined critical curves as function of 0, since their 
location depends on the source redshift. 

4.3 The case for non-critical clusters 

If the cluster is non-critical for all redshifts of the sources, the inversion problem can be 
simplified because Y h (k, 7) = for all n - see (3.7) - and the X n depend only on k, 

X u (k)= [ dwp w (w)(—^-) . (4.10) 

Jo \1- KWj 

Then, (3.6) simplifies to 

(Oe s ,,=7 n *n(«). (4-11) 

As a result, the iteration procedure described in the last subsection can be applied in 
a somewhat simpler way, by using (4.11) instead of (4.7); in addition, one can use the 
approximation (A2.4) derived in the Appendix 2 for X\(k) which yields 



-(w) 

(w) 



« : — -T^) ■ (4-12) 

1 — K- L 



This approximation is found to be sufficiently accurate to describe X\{k) for (generic) 
non-critical clusters (see Fig. 8 in Appendix 2). 

Combining (4.11) for n = 1, (4.12), and replacing the expectation value (e) z with 
the observed local average e, we obtain 

7 =--£- (!-/<«,)«) , (4.13) 
{w) 

with the definition 

(4-14) 

(w) 

Inserting this expression for 7 into (4.1), one obtains after some manipulations 

VVK=±(l-f(w) K )(l 1 > 1 + l 2 > 2 ) , (4.15) 
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with the matrix 



Thus, 



with the inverse 



If we now define 



V = 



1 + fei fe 2 
fe 2 1 - /ei 



Vk -^-(1 f(w) K ) I' ; (' A • J> 2 ) . (1.16) 

\W) \ e 2,l — £1,2 



\ 1 = - f 1 /V 1 - f( - ) (11 7) 

W 2 (e1 + e1) V -/c 2 l + /ci 1 



K(0):=]n(l-f(w) K (0)) , (4.18) 

(4.16) can be written as 

VK(0) = fV' 1 ( €l ' 1+e2 'A =u(6) . (4.19) 

V e 2,i- e i,2/ 

Thus, the vector field u(0) can be constructed directly from the observable e, as in the 
case of all sources being at the same redshift, and the same inversion equation (4.4) should 
be used, but with the current definitions of K and u. Thus, by using the approximation 
(4.12), the inversion of a non-critical cluster is no more complicated than in the case of a 
single source redshift. In particular, (4.18) immediately shows that K can be determined 
only up to an additive constant, which implies the invariance transformation 

«(*) -> + x <°) ; ( 4 - 2 °) 

in other words, (l — (w 2 ) k(0)/ («;)) can be determined only up to a multiplicative con- 
stant. 



5 Breaking the mass invariance 

In Sect. 4, we have shown that there exists an invariance transformation for the surface 
mass density map which leaves the observed local mean image ellipticities constant (4.5, 
4.9 & 4.20). The occurrence of the invariance can be traced back to the fact that the 
inversion equation was derived from a first-order differential equation - see (4.1) - which 
has a free integration constant. We now discuss several ideas to determine this constant. 

5.1 Using only shape information 



5.1.1 A maximum-likelihood approach. We consider a fixed set of 'observed' galaxies 
with ellipticities e« and positions 0j. Then we reconstruct the surface mass density of 
a cluster as described in Sect. 4.2 assuming a value R for the mean mass density in the 
field [see Eq. (4.9)]. From that we calculate the mass density n(6i) and shear 7(0*) at the 
positions 0* (i = 1, N). Next we calculate the probability p(e*, Av(0*), 7(0*)) to observe 
the image ellipticity e* at the galaxy position 0*. Finally we maximize the likelihood 
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C\ = IIip(ei, K(Oi), 7(0$)) using different vafues for R in the mass reconstruction. We find 
that the mass degeneracy is broken, but unfortunately only weakly, unless the redshift 
distribution is broad and the surface mass density is unrealistically high. From various 
simulations we conclude that the likelihood of observing a set of galaxy ellipticities e(0$) 
gives no significant limits on the mean mass density in the field. Furthermore, the 
likelihood depends much stronger on the assumed redshift distribution - which in practice 
is weakly constrained - than on the value of the mean mass density R and is therefore a 
much better tool to investigate the redshift distribution of the sources than to determine 
the mean surface mass density of a particular cluster. 



5.1.2 Second moments of image ellipticities. For non-critical clusters, the ratio 

2 

R = 



1-K^- 



(A ' X X {KY f (w 3 ) \ 



depends only on the local surface mass density k. We derive (5.1) in Appendix 3. Mea- 
suring R could therefore provide a direct estimate for k. However, we can not measure 
(e 2 ) e z and (e) e z accurately enough to determine the local mass density k or the average 
mass density R from a resonable number density of image ellipticities (see Appendix 3) . 

5.2 Magnification information through number density of images 

Let us assume that the number density uq(S, z) of the unlensed faint galaxies with flux 
S and redshift z is given through 

n (S,z)=p z (z)F(S) , (5.2) 

where p z (z) is the normalized redshift distribution and F(S) the distribution in flux. 
Note that the factorization of no(S, z) as given in (5.2) is a fairly special assumption for 
the galaxy distribution, which will not be valid in general. However, for simplicity we 
shall make use of (5.2) in this paper. This factorization may in fact be an approximate 
distribution over a limited range of flux S. Since the magnifications are not very large 
except perhaps in the very central parts of the cluster, the 'dynamic range' over which 
(5.2) is applied is not large and might be a valid description. It should be noted that 
this factorization has already been used implicitly in (3.6). If (5.2) is not assumed, then 
the redshift distribution of sources locally - where the surface mass density is k and the 
shear is 7 - will differ from p z (z) due to the magnification, so that the functions X n and 
Y n will attain an additional dependence on k and |7|. This will not cause any additional 
conceptual problem. 

Then, consider a position in the cluster with surface mass density k(0) and shear 
7(0). The observed number density of galaxies with redshift z and flux larger than S is 
(see Broadhurst et al. 1995) 

„(>S, i ,«)=p lW -jj- J F(>-^) , (5.3) 

with the magnification defined in (2.8) The total number density of galaxies observed 
with flux larger than S is obtained through integration of (5.3) and yields 
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If F(> S) oc S~ a , then we obtain from (5.4) 

/•OO 

n(> S, 6) = n (> S) / dz p z (z) ^~\6, z) = n (> S) {^~\0, z)> (5.5) 

./o 

From Eq. (5.5) we conclude that the number density is not changed if a = 1 and it is 
increased (decreased) for a > 1 (a < 1). Next, averaging (5.5) over the data field U with 
area U, we obtain 

(n(> S, 0)) u = ^l d2 ° <> S, 0) = n (> S) {^~\0, z)) gjU . (5.6) 

Hence, the ratio of the number of observed galaxies in the data field U to the num- 
ber U no(> £) which would be observed in the absence of a foreground lens gives 
( / u a_1 (0, z)) w . The number density no(> 5) of the unlensed galaxies is regarded as an 
universal function and has been measured in several colours down to very faint magni- 
tudes (see for example Smail et al. 1995). Because of this, the local observables in the 
case of cluster lensing are the moments of the image ellipticities (e n ) (6) and (/u(0, z)) z . 
We note that the result that (n(6,z)) z is a local observable is only true for the ansatz 
(5.2) with F(> S) oc 5 ,_a . If either the ansatz (5.2) does not hold, or if F(> S) is not a 
power law, then the observable quantity is a different one and can in particular have a 
much more complicated dependence on the magnification. However, n(> S, 0) can still 
be compared - may be not analytically - with no(> S) and provides information on the 
local magnification. 

In the following we consider a fixed value of the flux threshold S and use for muta- 
tional simplicity n(6) and no instead of n(> S, 6) and no(> S). 

5.2.1 Do galaxy positions break the mass invariance? The probability to observe a galaxy 
within the area d 2 6> around 6 is 

Using Eqs. (5.5) and (5.6) we obtain this probability in terms of the local (redshift- 
averaged) magnification 

The likelihood C for observing iV galaxies with positions 6i (i = 1, . . . , N) is 

£ = n, p{0i) . (5.9) 

On the other hand, we can perform the mass reconstruction according to Eq. (4.9) and 
obtain ( / u a_1 (0, z, k)} as a function of the assumed mean mass density R in the field. 
From that we can calculate with Eq. (5.8) the probability density p(0i) and finally ac- 
cording to (5.9) the likelihood C(k) for the assumed mean mass density. However this 
does not work. To understand this, assume for a moment that all galaxies are at the same 
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redshift; for simplicity we take this redshift to be 'infinity' here. Then the reconstructed 
mass density k is related to the true one K true through the invariance transformation 
(4.5) which gives (1 — Ktrue)A = 1 — k and 7t ru eA = 7. For the magnification we obtain 



£(e) = n a - 1 (p)\*- 2a , (5.10) 



and from that we conclude that p(0) is independent of A, or, equivalently, independent 
of R. This means that for all sources at the same redshift, the positions of the observed 
galaxies do not give any information on the mean mass density in the field. 

If the sources are distributed in redshift then the mean mass R assumed for the 
reconstruction changes the ratio 

«™= i£hksf& - (5 - n) 

However, the dependence on R is very weak as can be verified for non-critical clusters 
through calculating, for fixed k(0) and 7(0), the function (/i a_1 (0, z)) z (A) using the 
invariance transformation (4.20). As a result, the positions of the observed galaxies 
provide no or not enough information to break the mass degeneracy. 

Up to here all our attempts to break the mass degeneracy in practice have failed. 
This is because for all cases considered, the mass degeneracy is - in theory - broken 
because of the presence of a redshift distribution of the sources and is no longer broken 
if all sources are at the same redshift. We conclude that in order to derive R one has to 
use a method which breaks the mass degeneracy not because of the presence of a redshift 
distribution of the sources, but independent of that. 

5.2.2 Breaking the mass degeneracy through the total number of observed galaxies. We 

shall now consider the magnification effect on the distribution of galaxy images. We do 
not use the magnification effect locally as proposed by Broadhurst et al. (1995) and 
Broadhurst (1995) in order to derive the local mass density from it, but we use the 
'global' magnification effect, i.e., we compare the total number of observed galaxies with 
the expected (unlensed one), and relate that to the mean mass density inside the observed 
field. 

In the above subsection we have not used the number density no of galaxies observ- 
able in the absence of lensing. If we use this, then we can calculate from Eq.(5.6) the 
expected number (N) (R) of galaxies as a function of the mean mass density R assumed 
in the mass reconstruction, 

(N)(R) = U n G (^-\0,z,R)) zM (5.12) 

and compare that with the number N of galaxies actually observed. We compare the 
deviation in terms of the standard deviation a = a/ (N) and obtain 

The value R for which x 2 (k) shows a minimum (in fact, at which x 2 = 0) gives the most 
probable value for the mean surface mass density of the cluster. 
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We note that in the case of a single redshift of the sources (see Sect. 4.1) the de- 
termination of the mean mass density is even simpler: the mass reconstruction is done 
according to (4.4) with an assumed value for K, say K = 0. From the resulting mass 
and shear maps k(0) and 7(0), one can calculate ( / u Q_1 (0)) w . Because of Eq. (5.6) we 
can relate that to the true magnification and from Eq. (5.10) we conclude that 



N 



Un = = <^ue 1 (0)> w = A 2 " 2 " (r°-\0)) u • (5.14) 

Thus, we can immediately calculate A from (5.14), and with the invariance transformation 
(4.5) the true mass density [1 — Ktrue(0)] A = 1 — k(0). 

In Sect. 6.2, we apply this method to break the mass invariance to a numerically 
simulated cluster (z^ = 0.4), for the case that the sources are distributed in redshift. We 
demonstrate that the method yields good estimates for the mean surface mass density, 
i.e., it yields good estimates on the total mass inside the observed field. However, we 
point out that the method is not applicable in the case that a ~ 1, because then no 
magnification bias or antibias occurs, and the observed number density of galaxy images 
provides no information on the mean mass density R in the field. 

We note that the method suggested by Bartelmann & Narayan (1995) still works 
in this case: they made use of the magnification of individual galaxy images, at fixed 
surface brightness which is known to be unchanged by gravitational light deflection. 
Their method will become extremely useful once sufficient HST data become available 
to obtain the mean size-surface brightness relation for these faint galaxies. 



6 Application to simulated data 

For testing the reconstruction technique we use the same numerically modelled cluster 
as in Paper II. Using its surface mass density shown in Fig. 2, we generate a distribution 
of synthetic galaxy images using the source ellipticity distribution 

»fel> = ,g[l-Jp(-l/JP)] e *p(-|^/fl 2 ) -th B = 0.15, (6.1) 

and the redshift-distribution (3.9) with /3 true = 1 and (z) true = 1. The galaxies are 
randomly distributed in the source plane, and the number density of galaxy images is 
^gai = 60/arcmin 2 . The cluster is placed at a redshift z d = 0.4 so that (w) = 0.44, 
(w 2 ) = 0.25 and / = (w 2 ) / (w) 2 = 1.29. 

6.1 The derived mass distribution for different assumed redshift 
distributions of the sources 

We show in Fig. 3 the resulting surface mass density map for R = Rt ru e = 0.12, (3 = 
Arue = 1 and (z) = (z) true = 1, i.e., the assumed redshift distribution is the true one. 
We find a good agreement with the original mass density displayed in Fig. 2. Compared 
to the mass reconstruction shown Paper II - where all sources were at the same redshift 
- the noise is increased due to the dependence of the lensing strength of the cluster on 
the redshift of the sources. 
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However, in practice we do not know K true and we therefore choose R such that the 
minimum of the - locally smoothed - reconstructed surface mass density is zero. The 
reason for not requiring the minimum of the reconstructed mass map to be zero but that 
of the locally smoothed mass density (gaussian smoothing function with smoothing length 
of 1') is that the former quantity is much more affected by noise. Choosing R in that 
way leads to a good reconstruction only if the minimum of the cluster mass distribution 
is indeed close to zero, what is valid in practice if the observed field is several Mpc wide. 

With that procedure we obtain almost the same mass density as that shown in Fig. 3 
and the mean mass density detected inside the field is R = 0.133. 




Fig. 2. The surface mass density distribution of the numerically modelled cluster. The sidelength is 
about 10' corresponding to 3.8 Mpc for Hq = 50km/s/Mpc and an EdS universe. The levels of the 
contourlines are 0.1, 0.2, . . . , 1.3. 




Fig. 3. The reconstructed mass density obtained solving Eq. (4.9) iteratively, assuming that the redshift 
distribution is known and assuming R = Strue = 0.12. We obtain almost the same result if we choose R 
such that the minimum of the locally-smoothed mass density is zero. This assumption leads to R = 0.133. 
The levels of the contourlines are the same as in Fig. 2 
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In Fig. 4 we show the mass distribution obtained using (z) = 1.5 and (3 = 2. The 
mean mass density R = 0.098 is chosen such that the minimum of the locally-smoothed 
mass density is zero. Overestimating the mean redshift of the galaxies leads to an un- 
derestimation of the overall mass distribution and vice versa. This is shown in Fig. 5, 
where we use (z) = 0.6 and (3 = 3 for the reconstruction and obtain a mean mass density 
of R = 0.228. Common to the Figs. 3 to 5 is that the substructure is nicely recovered, 
regardless of the assumed redshift distribution. 

In Fig. 6 we show the mean mass density obtained for different assumed redshift 
distributions. We find that this is approximately proportional to 1/ (w), i.e., it is inversely 
proportional to the assumed mean effective lensing strength. 




Fig. 4. The reconstructed mass density obtained solving Eq. (4.9) iteratively assuming that the redshift 
distribution given through (3.9) with (z) = 1.5 and (3 = 2. The true redshift distribution is described by 
true — 1 and /3true — 1- For the mean mass density we obtain k — 0.098. The levels of the contourlines 
are the same as in Fig. 2 



6.2 Breaking the mass invariance in practice 

We now use Eq. (5.12) to break the mass invariance. First we choose the parameter 
a describing the number counts versus flux of the faint galaxies. Again, the source 
ellipticity- and redshift distribution is given through (3.9) and (6.1) with parameters as 
given in Table 1. The spatial distribution of the galaxy images is given through Eq. (5.5), 
where (/U a-1 (0, z)} is calculated from integrating the 'true' local magnification weighted 
with the redshift distribution of the sources. We keep the number of galaxy images (lens 
plane) fixed and use N = 5260 galaxy images for our simulations. 

Next we perform the mass reconstruction according to Eq.(4.9) for different values 
of R. From the resulting mass and shear map we calculate the map ( / u a_1 (0, z, «)) , 
where we assume to know the true redshift distribution of the sources and the true slope 
a. Finally, we derive from Eq. (5.12) the number of expected galaxies for the assumed 
value of R and calculate from that with Eq. (5.13) the function x 2 (k). The resulting 
function x 2 (k) is shown in Fig. 7 for different values of the parameter a. 

We find that all curves x 2 (k) show a minimum close to Rtme = 0.12, but the minima 
are narrower the more the slope a deviates from one. This is because the larger |1 — a\, 
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Fig. 5. Same as Fig. 4 but assuming that the redshift distribution is given through (3.9) with (z) = 0.6 
and = 3. For the mean mass density we obtain R = 0.228. The levels of the contourlines are the same 
as in Fig. 2. Note that the scale of the z-axis is increased in this figure 



0.25 - 



I | I I I I | I I I I | I I I I | I I I I | I I I 



0.2 



0.15 - 



0.1 



0.5 



1 

<z> 



1.5 



0.2 0.3 



0.4 0.5 0.6 

<w> 



Fig. 6. The mean mass density R averaged over the data field U as a function of the assumed redshift 
distribution of the sources. The mean mass density is obtained such that the minimum of the locally 
averaged mass density is zero. For redshift distributions with (3=1 we use crosses, for (3 = 1.5 triangles 
and for (3 = 3 dashes. The left panel shows the mean mass density obtained from the reconstruction 
as a function of the mean redshift (z), the right panel as a function of (w) defined in (A2.1). For 
producing the synthetic galaxy images we use for their redshift distribution (z) t = 1 and fitrue = 1, 
i.e., (w) = 0.44. The true mean mass density of the cluster is Ktrue = 0.12 



the stronger the magnification bias (or anti-bias) and the stronger the dependence of 
(N) (R) is on R. Hence, for the same number density of galaxy images, the significance of 
rejecting values of R increases with increasing |1 — a\. From our simulations we find that 
the mean surface mass density is 0.09 < R < 0.16 (2a level) for a = 0.25, 0.06 < R < 0.18 
for a = 0.5 and < R < 0.23 for a = 0.75. The true value of the mean surface mass 
density is Rtme = 0.12. Increasing the number density of the observed galaxy images 
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Fig. 7. The x 2 («) 

as a function of the assumed value of the mean mass density re used for the mass 
reconstruction. We use different parameters a to describe the dependence of the number counts on the 
flux (F(> S) oc S~ a ) of the faint galaxies. We assume to know the redshift distribution of the sources 
and the parameter a. The true values of the mean mass density IS /^true 

= 0.12 

improves the significance levels, because the x 2 function is roughly proportional to the 
number of observed galaxies. We note that the galaxies used for determining R can be 
different from that used for the mass reconstruction, since the shape of the galaxies has 
not to be measured for the former purpose, increasing the number density of available 
galaxy images considerably. A number density of n ga i = 100/arcmin 2 would reduce the 
2a confidence levels to (0.08, 0.17) for a = 0.5 and to (0.10, 0.15) for a = 0.25. 

In practice, the redshift distribution of the sources is poorly known and the mean 
redshift (z) can be over- or underestimated in a cluster mass reconstruction. If the true 
mean redshift is (z) tTue = 1 and if we assume for the reconstruction (z) = 0.8 [1.5], then 
we obtain the 2a confidence limits k G (0.11,0.23) [k G (0.08,0.16)]. To derive these 
limits we used a slope a = 0.5 for the number counts of the faint galaxies. 



7 Discussion 

The present paper should be considered as a further step in building up the ground for 
nonlinear cluster reconstruction techniques. In contrast to Papers I & II we have taken 
into account a redshift distribution of the sources. Our main results can be summarized 
as follows: 

(1) We have related the local expectation values of the moments e n of the image 
ellipticities to the local lens parameters 7 and k in Eq. (3.6). These moments depend 
in a complicated way on the redshift distribution of the sources as shown in Eqs. (3.6) 
and (3.7). In Sect. 4.3 we have shown that in the case of non-critical clusters the local 
expectation values (e n ) depend only on the local lens parameters 7 and k and on few 
moments (w k ) of the effective distance w of the sources from the lens, weighted with the 
redshift distribution of the sources [see Eqs. (4.12), (4.15), and (4.16)]. 

(2) In Sect. 4.2, we have generalized the inversion method developed in Paper II such 
that the redshift distribution of the sources is accounted for. This is important if the 
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cluster is at a high redshift (z^ ^ 0.2), because then the effective lensing strength for 
sources with z s ~ 0.5 is significantly different from that for sources with z s ~ 1. For 
the reconstruction the redshift distribution of the sources has to be assumed and the 
inversion equation is solved iteratively. In Sect. 6 we have applied the reconstruction 
technique to synthetic data and shown that the reconstructed mass density is in good 
agreement with the original one. Compared to the mass reconstruction in the case of a 
single source redshift, the noise is slightly increased if the same number density of galaxy 
images is used. 

(3) For non-critical clusters we have simplified the inversion method in Sect. 4.3 
such that only one integration is necessary. Then the inversion equation is very similar 
to that developed previously for a single redshift of the sources (see Sect. 4.1). For the 
reconstruction of a non-critical cluster, one only has to assume the first two moments 
(w) and (w 2 ) of the effective distance w weighted with the redshift distribution. The 
weaker the cluster, the less information on the redshift distribution has to be available 
for the reconstruction of the mass distribution. For a weak cluster it is sufficient to know 
the first moment (w). 

(4) We have shown that the mean surface mass density R is still a free variable in 
the inversion equation (4.9), i.e., there still remains a global invariance transformation 
for the surface mass density field which leaves the field of (e) unchanged. For non-critical 
clusters, this invariance transformation is given explicitly in Eq. (4.20) and it is similar 
to that derived previously for a single redshift of the sources [see Eq. (4.5)]. 

(5) In Sect. 5 we have discussed some ideas to break the mass invariance: maximizing 
the likelihood for the observed image ellipticities, using the second moments (e 2 ) of image 
ellipticities or the magnification effect on the position of the galaxy images. For all these 
cases, the mass invariance is broken only in the presence of a redshift distribution of the 
sources. We find that the invariance is broken too weakly to make use of it in practice. 

(6) Using the magnification effect on the number density of the galaxy images (see 
Broadhurst, Taylor & Peacock 1995) is the most promising way to break the mass de- 
generacy. In contrast to Broadhurst et al. (1995) and Broadhurst (1995), we do not use 
the observed local number density to derive the local surface mass density (because this 
local estimate has a considerably larger noise) but we use the total number of observed 
images to determine the mean mass density: we assume a value for R, perform the mass 
reconstruction, then calculate the local magnification and from that the number of ex- 
pected galaxy images in the field. Comparing this with the number of 'observed' galaxy 
images via a x 2 -analysis gives a confidence interval for R. This method works only if 
the distribution F(> S) oc S~ a of the unlensed sources with flux larger than S has a 
slope a 7^ 1, because for a = 1 the number density of lensed objects is not changed 
compared to the unlensed one. The confidence interval obtained for R is the narrower, 
the larger \a — 1| is, because the magnification bias (anti-bias) increases with increasing 
a > 1 (decreasing a < 1). 

(7) In Sect. 6 we have used the same numerically modelled cluster as in Paper II for 
testing the reconstruction technique. We reconstruct the mass density for the case that (i) 
the assumed redshift distribution is the true one, (ii) the mean redshift is overestimated 
and (iii) underestimated. The mean mass density R is adjusted such that the minimum 
of the reconstructed mass density is zero. We find that the substructure of the cluster is 
nicely recovered regardless of the assumed redshift distribution. The total mass detected 
is within 10% of the true mass for (i), underestimated for (ii) and overestimated for (iii). 
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Using the method as describe above in (6), we find that reliable limits on the mean mass 
density can be derived with a significance increasing with increasing number density of 
galaxy images and increasing \a — 1|. We note that for a successful application of this 
method on real data it is essential (a) to know the number density of the unlensed sources 
which would have been detected in the absence of lensing but under the same observing 
conditions, (b) to choose a colour which gives an appropriate slope of the faint galaxies 
in flux (]ck — 1 1 large !) and (c) to know the mean redshift of the sources considered. 

Finally, we should mention that although the cluster reconstruction method devel- 
oped in this and our earlier papers is considerably more complicated than straight appli- 
cation of the original Kaiser & Squires (1993) method, these modifications are essential 
if applied to WFPC2 observations of a cluster center. For the one case we considered, 
i.e., the cluster CI 0939+4713 (Seitz et al. 1996), the small field-of-view of the WFPC2, 
the lensing strength of the cluster center, and the fairly high redshift z& = 0.4 of the 
cluster make it absolutely necessary to apply an unbiased finite-field inversion technique, 
to use a non-linear reconstruction method, and to account for a redshift distribution of 
the galaxies. 
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Appendix 1 

Let us derive Eq. (3.5), first considering the case \g(z)\ < 1. We define e s = yexp(2i(p) 
and u = exp (2itp), so that d<p = —i du/u, use Eq. (3.4b) and begin with the ^-integration: 
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In the last step, we used that the integrand has one pole at u = where the corresponding 
residuum is [—g(z)] n ; the other pole at u = l/(g*y) lies outside the unit circle and thus 
does not contribute. As a result we obtain for \g(z) \ < 1, 
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Similarily, we derive for \g(z)\ > 1: 
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since the pole at u = g/y lies again outside the unit circle; we thus find that for \g(z)\ > 1, 
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Appendix 2 



Here we derive approximations for X\(k) and X 2 (k) for the case of non-critical clusters. 
This non-criticality implies that 1 -to > 0. We can expand the integral in (4.10), using 
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to obtain 
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It is easily seen from (4.10) that for n > 1 the following relation holds: 
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As it turns out, whereas the power series in (A2.2) converges, a Pade approximation 
behaves much better. From (A2.2) and (A2.3), we obtain for the lowest-order approxi- 
mation (see e.g. Press et al. 1992, page 194 ff.) 
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and for the next higher order, 
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In Fig. 8, left panel, we compare Xi(k) with the approximations (A2.4) and (A2.5), 
and the expansion (A2.2) up to the term k = 2 for a cluster with Zd = 0.4 and the 
redshift distribution shown in Fig. 1 (i.e., for which (z) = 1). We find that the approxi- 
mations (A2.4) and (A2.5) are excellent for k ^ 0.6, good for 0.6 ^ k ^ 0.8, and clearly 
deteriorate for k ^ 0.8. As expected, approximation (A2.5) is always better than (A2.4). 
In Fig. 8, right panel, we compare X 2 {k) with the approximations given above. We find 
qualitatively the same result, but the deviation of X 2 (k) from the approximations (A2.4) 
and (A2.5) becomes important for smaller values of «, k ^ 0.6, say. We note that these 
approximations become even better if the difference between mean redshift of the sources 
and the cluster redshift increases. 
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Fig. 8. Comparing different approximations for X\(k) (left panel) and Xi{k) (right panel) as defined 
in Eq.(3.7) as a function of k for the redshift distribution p z {z) shown in Fig. 1. The dotted lines show 
the approximations given in Eq. (A2.4), the dashed one those of Eq. (A2.5) and the long dashed line the 
expansion given in Eq. (A2.2) up to k = 2 as an approximation for X\(k), and its first derivative as an 
approximation for ^(k) (X2 = —dX\/dn), respectively. The solid lines show the exact result for X\ 
and X2 obtained from performing the integration (4.10) 



As a result, we obtain that the approximation (A2.4) for X\(k) is sufficiently accurate 
over a wide range of k (if k approaches values close to 0.8, the cluster most likely is critical 
anyway). Therefore, one can use approximation (A2.4) for (most) non-critical clusters. 



Appendix 3 



In the case of a non-critical cluster (or at positions of a critical cluster where k + |7| < 1) 
the ratio 




(A3.1) 



becomes according (4.10) 

R = X 2 {k)/X 2 {k) . (A3.2) 

Therefore, for a fixed redshift distribution of the galaxies, the ratio R depends on k only. 
If we identify the expectation values again with the local averages, 



R 



(A3.3) 



a comparison of (A3. 2) with (A3. 3) can in principle determine the local value of k. In 
Fig. 9 we show R for a non-critical cluster as a function of k (solid line) for the redshift 
distribution shown in Fig. 1 (/? = 1 = (z)). The following approximations for R(k) can 
be derived from (4.11) and the relations in Appendix 2: 
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We obtain Eq. (A3. 4a) using the approximations (A2.4) for Xi(k) and X 2 (k). For 
Eq. (A3. 4b) we use the approximations (A2.5) for X\ and A 2 , and in the case of 
Eq. (A3. 4c) we use the approximation (A2.5) for X 2 and the approximation (A2.4) for 
Xi. We compare these approximations with R = X 2 {k)/X 2 (k) in Fig. 9: for the chosen 
redshift distribution, (A3. 4a) gives R ~ 1.29, approximation (A3. 4b) is shown as the 
dotted line and is obviously a good approximation for small k ^ 0.2, whereas approxima- 
tion (A3. 4c) is shown as the dashed line and fits well up to k ^ 0.8. For higher surface 
mass densities the cluster is most probably critical and Eq. (A3. 2) does not hold anyway, 
but has to be replaced according to Eq. (3.6); then, R becomes a function of 7 and k. 
Eq. (A3. 4c) can be inverted to obtain k as a function of the observable quantity R, 
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In Fig. 9 (right panel) we compare the surface mass density k derived from Eq. (A3. 5) 
with the true one and find indeed a good agreement for k ^ 0.8. 




Fig. 9. The solid line (left panel) shows the ratio R = | ^e 2 ^ | / | (e) 2 | as a function of the surface mass 

density k in the case of a non-critical cluster, for which R = X2(k) /X^(k). The dotted line shows the 
approximation (A3. 4b) and the dashed line (A3. 4c). We can invert (A3. 4c) to derive the approximation 
(A3. 5) for k. The resulting values k* are compared to the true one k in the right panel 
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From Eq. (A3. 5) one might conclude that k(6) can be directly obtained from observ- 
ing R{0) without using any of the inversion techniques derived in the past. Unfortunately 
this is not the case because R{0) can not be determined precisely enough from local ob- 
servations. To obtain a reliable estimate for a local surface mass density k ^ 0.4, one 
would need about 1000 galaxy images. 

Thus, we try to derive the mean surface mass density R by measuring R{0): assuming 
a value for R we perform the reconstruction according to (4.9) and derive from the mass 
and shear map the corresponding map of R(0), which then depends on the assumed value 
for the mean mass density R. Finally, we compare this with the measured R(0), i.e., we 

search for R which minimizes x 2 = / (j%(0) — R(0)^j \ , averaged over the data-field IA. 

Again, we find that the mean surface mass density can not be determined in that way, 
because the local mean image ellipticities (e) and (e 2 ) can not be derived with sufficient 
accuracy from a reasonable number density of galaxy images. 

We note that we tried to determine R in various other ways from moments of im- 
age ellipticities: assuming a value for R we performed the reconstruction and calcu- 
lated from the mass and shear map the local expectation values of (e n ) (R) according to 
Eq. (3.6). Then, we compared that with the 'measured' local means e n and minimized 
X 2 = |(e n ) (R) — £ n \u var ying the value R used for the reconstruction. All these attempts 
failed for resonable assumptions for the number density of galaxy images, showing that 
the image ellipticities provide not enough information to break the mass degeneracy in 
practice. 
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